Incorporation of protein binding effects into likelihood ratio test for exome sequencing data

نویسندگان

  • Dongni Zhang
  • Hongzhu Cui
  • Dmitry Korkin
  • Zheyang Wu
چکیده

Statistical association studies are an important tool in detecting novel disease genes. However, for sequencing data, association studies confront the challenge of low power because of relatively small data sample size and rare variants. Incorporating biological information that reflects disease mechanism is likely to strengthen the association evidence of disease genes, and thus increase the power of association studies. In this paper, we annotate non-synonymous single-nucleotide variants according to protein binding sites (BSs) by using a more accurate BS prediction method. We then incorporate this information into association study through a statistical framework of likelihood ratio test (LRT) based on weighted burden score of single-nucleotide variants (SNVs). The strategy is applied to Genetic Analysis Workshop 19 exome-sequencing data for detecting novel genes associated to hypotension. The SNV-weighting LRT idea is empirically verified by the simulated phenotypes (336 cases and 1607 controls), and the weights based on BS annotation are applied to the real phenotypes (394 cases and 1457 controls). Such strategy of weighting the prior information on protein functional sites is shown to be superior to the unweighted LRT and serves as a good complement to the existing association tests. Several putative genes are reported; some of them are functionally related to hypertension according to the previous evidence in the literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of 3D protein Structure based on Mutation of AKAP3 and PLOD3 Gene in Case of Non-Obstructive Azoospermia

Background: The present study has been designed with the aim of evaluating A-kinase anchoring proteins 3 (AKAP3)and Procollagen-Lysine, 2-Oxoglutarate 5-Dioxygenase 3 (PLOD3) gene mutations and prediction of 3D proteinstructure for ligand binding activity in the cases of non-obstructive azoospermic male.Materials and Methods: Clinically diagnosed cases of non-obstructive azoos...

متن کامل

Whole exome sequencing revealed a novel dystrophin-related protein-2 (DRP2) deletion in an Iranian family with symptoms of polyneuropathy

Objective(s): Charcot-Marie Tooth disease (CMT) is one of the main inherited causes of motor and sensory neuropathies with variable expressivity and age-of onset. Although more than 70 genes have been identified for CMT, more studies are needed to discover other genes involved in CMT. Introduction of whole exome sequencing (WES) to capture all the exons may help to fin...

متن کامل

Rare Variants Detection with Kernel Machine Learning Based on Likelihood Ratio Test

This paper mainly utilizes likelihood-based tests to detect rare variants associated with a continuous phenotype under the framework of kernel machine learning. Both the likelihood ratio test (LRT) and the restricted likelihood ratio test (ReLRT) are investigated. The relationship between the kernel machine learning and the mixed effects model is discussed. By using the eigenvalue representatio...

متن کامل

The weighting is the hardest part: on the behavior of the likelihood ratio test and score test under weight misspecification in rare variant association studies

Rare variant association studies are at a critical inflexion point with the increasing availability of exome-sequencing data. A popular test of association is the sequence kernel association test (SKAT). Weights are embedded within SKAT to reflect the hypothesized contribution of the variants to the trait variance. Correct weighting is expected to boost power, and yet the correct weights are ge...

متن کامل

Whole Exome Sequencing for Mutation Screening in Hemophagocytic Lymphohistiocytosis

Background: Hemophagocytic lymphohistiocytosis (HLH) is an immune system disorder characterized by uncontrolled hyper-inflammation owing to hypercytokinemia from the activated but ineffective cytotoxic cells. Establishing a correct diagnosis for HLH patients due to the similarity of this disease with other conditions like malignant lymphoma and leukemia and similarity among its two forms is dif...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2016